Discrimination strategies for inequivalent classes of multipartite entangled states 
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How can one discriminate different inequivalent classes of multiparticle entanglement experimen- 
tally? We present an approach for the discrimination of an experimentally prepared state from the 
equivalence class of another state. We consider two possible measures for the discrimination strength 
of an observable. The first measure is based on the difference of expectation values, the second on the 
relative entropy of the probability distributions of the measurement outcomes. The interpretation 
of these measures and their usefulness for experiments with limited resources are discussed. In the 
case of graph states, the stabilizer formalism is employed to compute these quantities and to find 
sets of observables that result in the most decisive discrimination. 
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I. INTRODUCTION 

With the rapid progress of quantum control, the ex- 
perimental creation of a variety of multiparticle entan- 
gled states has become feasible 0, Q ■ When more than 
two particles are entangled, it is well known that there are 
different and inequivalent entanglement classes, but there 
are even different classification schemes: In a first ap- 
proach, one may consider two states I';/') and as equiv- 
alent, if one can be converted into the other by changing 
the local bases only. These operations are called local 
unitary (LU) operations and recently a method to decide 
whether two states are LU equivalent or not has been 
found d, H[ • Another classification is based on the ques- 
tion regarding whether a single copy of can be con- 
verted into \4>) by local operations and classical commu- 
nication, even if this conversion works only with a small 
probability [5| . These operations are called stochastic lo- 
cal operations and classical communication (SLOCC). 
Similar to LU equivalent states, states equivalent under 
SLOCC can often be used for the same applications. As 
the number of SLOCC classes is infinite for more than 
three qubits [6], modified classification schemes have been 
proposed [7|, [8| . 

On the theoretical side, it has been shown that differ- 
ent classes of entangled states are suited for different ap- 
plications. For example, cluster states are useful for one- 
way quantum computation, whereas Greenberger-Horne- 
Zeilinger (GHZ) states are not @. To the contrary, for 
sub shot-noise interferometry, GHZ states are opti mally 
suited, while cluster states are useless for this task [lfjj . 
Consequently, it can be important to discriminate exper- 
imentally between the different classes. 

For the experimental verification of entanglement, a 
number of tools exist — the most prominent example are 



witness operators 11| . As experiments no longer aim only 
at the creation of entanglement but also at creating spe- 
cific classes of entangled states, tools are needed for the 
experimental discrimination of these classes. In the con- 
text of entanglement detection, it is well known that a 



given Bell inequality or witness operator detects only a 
part of all entangled states and fails to detect others. 
Thus the violation (or nonviolation) of a Bell inequality 
can provide information not only about the entanglement 
present in a state but also about its type [HI, [l3| . 

Consequently, in Ref. [14] Bell operators have been 
constructed and experimentally implemented for discrim- 
inating different classes of entangled states. For an exper- 
iment aiming at the creation of a particular state, a Bell 
operator characteristic for this state was designed, that 
is, a Bell operator that has the desired state as eigen- 
state with maximal eigenvalue. The maximal expecta- 
tion value of this Bell operator for various other classes 
of states (defined as all LU equivalents or all SLOCC 
equivalents of some prominent entangled state) was de- 
termined. Measuring the Bell operator then proved that 
the prepared state was not in those classes with maximal 
expectation value lower than the experimentally obtained 
value. In this approach, the characteristic operator is far 
from unique. Neither is it necessary to use a Bell oper- 
ator, as has already been remarked in Ref. [lij . As en- 
tangled states with increasingly large numbers of qubits 
are being prepared, analysis tools that give strong re- 
sults in spite of a limited number of measurement events 
are needed. In the context of entanglement detection this 
problem has recently received attention [l5l - [l7j . 

In this article we present an approach for the discrimi- 
nation of an experimentally prepared state from the class 
of all local unitaries of another state. We define two mea- 
sures for the discrimination strength of an observable. 
The first measure is based of the difference of expecta- 
tion values and coincides with the one implicitly used in 
Ref. [ill • An interpretation of this quantity as a noise tol- 
erance is presented. The second measure is based on the 
relative entropy, also called Kullback-Leibler divergence, 
which is a well-established information-theoretic measure 
for the discrepancy of two classical probability distribu- 
tions [H, This quantity is directly related to the 
probability of another state to reproduce the observed 
measurement outcomes in a given number of measure- 
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ment runs. Our use of the relative entropy is motivated 
by a work of van Dam et al., where it was used to assess 
the statistical strength of nonlocality proofs [la ]. In the 
case of graph states, the stabilizer formalism helps us to 
compute these quantities and to find sets of observables 
that result in the strongest discrimination. We would like 
to add that our approach is not directly related to the 
task of state discrimination as it is often discussed in 
the literature [2(| ■ In particular we do not assume the 
promise that the state is either in the first or in the sec- 
ond family; such an assumption cannot be justified in an 
experiment aiming for the verification of entanglement 
properties. 

This article is organized as follows: In Section|n]we in- 
troduce the two measures for the discrimination strength. 
In Sections IIIII and IIVI we calculate these quantities for 
certain four- and three-qubit states and find optimal sets 
of observables for the discrimination task. The perfor- 
mance of the measures for experimental data and noisy 
states is investigated. In Section [V] a general result for 
the discrimination of graph states is presented. Finally, 
Section [VI] is devoted to a discussion of the results. 

II. THE SITUATION AND THE DISTANCE 
MEASURES 

We consider the following situation: In an experiment 
aiming at the preparation of a state g the experimenter 
wants to verify that the prepared state is not in a certain 
class of undesired states, given by all local unitaries (and 
maybe permutations of qubits) of a pure state \4>). For 
that, he can measure an observable A (or several observ- 
ables Ai) and one has to define to what extent such a 
measurement can exclude the undesired states. 

In this section we will define two quantities that mea- 
sure how well an observable A discriminates a state g 
from all local unitaries of another state \<fi). While we 
are restricting our attention to LU classes in this article, 
it should be noted that the same quantities can also be 
defined for SLOCC classes. 

A. A measure based on the fidelity 

In analogy to the approach taken in Ref. [l4| we define 
the fidelity based measure as 

T A (qU) = min \Tx{A Q )-{ct>\U^AU\ct>)\. (1) 

Here the minimization is over all local unitaries XJ\ later 
we will consider also the minimization over all permuta- 
tions of qubits. 

Adding or subtracting a multiple of the identity matrix 
to A does not change the value of F in Eq. ([T]) . So without 
loss of generality we can assume Tr(A) = 0. Also without 
loss of generality we assume Tr(Ag) > 0. 

In the following we will always assume that Ti(Ag) > 
max-Lu (4>\A\<fi) . This is no restriction for our purposes 



due to the following reasoning: Let us assume that for 
the pure n-qubit state \<fi) there exist 2™ local unitaries 
Ui such that {Ui\(j>)} forms an orthonormal basis. States 
with this property are called locally encodeable [2l|. It 
has been conjectured that all pure states are locally en- 
codeable, and the conjecture has been proven for a va- 
riety of states, including all stabilizer states and the W 
state [2l|. Then we can write 1 = Y.iUi^^Wi and 
since A is traceless we have Tr(^4^ i Ui\4>){(j)\Ul) = 0. 
So, if (</>|A|(/)) < there exists a local unitary U such 
that (cj>\W AU\4>) > and vice versa. If max LU ((/>|,4|</>) > 
Tr(Ag), by local encodeability there exists a local unitary 
U such that Tr (Ag) > > {<j>\W AU\4>). By continuity 
there exists another local unitary such that Tr(Ag) = 
((j>\W AU\<f)) , i.e., ^(^110) = and the observable A is 
not suitable for a discrimination procedure based on T . 
In this article, we shall be concerned with graph states 
and sometimes with the W state, so local encodeability 
is proven for our purposes and we have 

T a {qU) = Tr(Ag) - max(0|^|0). (2) 

This quantity, however, is not invariant under rescaling 
of A. To be able to compare different observables, we 
have to agree on a normalization. We choose Tr(Ag) = 
1, which is the same normalization as in Ref. [14j . and 
obtain 

F A ( Q \\ct>) = l-max{cf>\A\<f>}. (3) 

This is the first quantity that will serve us as a measure 
for the strength with which A discriminates g from all 
local unitaries of 

For more than one observable we define 

Fa u ...,aM\4>) = J7 i i: >? =1 aM\ ( I>)- ( 4 ) 

In the remainder of the article we will discuss how to find 
optimal families of observables A\, . . . , Ak for particular 
states g and \<p). 

Our definition has a direct physical interpretation in 
terms of a noise tolerance. For that, we exploit the sim- 
ilarity of our problem to the task of entanglement de- 
tection by virtue of witness operators and consider the 
robustness of T in Eq. ^ against white noise: Let 

£>wn(p) = (1 -P)-1 +PQ ( 5 ) 

be the state g affected by white noise. The maximal noise 
level (1 — p) such that 

Ty[Ag wn (p)} - max(0|A|</>) > (6) 

LU 

is given by [using Tr(A) = 0] 

l-p=l-max(0|A|^=J- A ( e ||0). (7) 

We will discuss the interpretation of J 7 as a noise toler- 
ance in more detail in Section IIII CI 
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B. A measure based on the relative entropy 



measurement outcomes for A 



From a statistical point of view, the task of discrim- 
inating a state g and a state a — by virtue of 
an observable A is the task of discriminating the corre- 
sponding probability distributions for the measurement 
outcomes of A. 

The relative entropy or Kullback-Leibler divergence is 
a well-established information-theoretic measure for the 
discrepancy between two classical probability distribu- 
tions [Tj| [l9| . The relative entropy of the probability dis- 
tributions P = {pi,.--,Pm} and Q = {qi,...,q m } is 
defined as 



£>(P||Q) = X>io g 



i=l 



•Pis 

-a- ' 



(8) 



We will always use the logarithm to the base of two, 
log = log 2 , and define 01og(0) = 0. Note that since we 
are dealing with the discrimination of classical proba- 
bility distributions, we are not using the quantum (or 
von Neumann) relative entropy [l9j], Tr[g(logp — logo - )]. 
The relative entropy satisfies < D(P\\Q) < oo with 
D(P\\Q) = if and only if P = Q. But although the 
relative entropy behaves in some sense like a distance be- 
tween probability distributions, it is not a metric because 
it is not symmetric. The most important properties of the 
relative entropy are summarized in Appendix [XI 

Concerning the interpretation, the relative entropy 
D(P\\Q) can be used to answer the question: How 
strongly does a sample (of a fixed length) from the dis- 
tribution P on average indicate that it was indeed drawn 
from P rather than from Ql This statement can be made 
precise with the theory of statistical hypothesis test- 
ing 0. 

For the simplest case, note that D(P\\Q) is infinity if 
and only if = 0, but pi > for some i, that is, if 
an event is impossible according to Q, but occurs with 
a nonvanishing probability according to P. This means 
that on observing this event one immediately knows that 
the sample was not drawn from Q. 

More generally, suppose that a sample of length N has 
been drawn from Q. We consider the empirical proba- 
bility distribution P defined by the observed frequencies. 
Then the probability Q N [T(P)] of drawing a sample from 
Q with the same frequencies [i.e., within the type class 
T(P)} decays exponentially for large N [H, Thm. 12.1.4], 



T(P)1 ~ 2" Ar - D(p|l<3) . 



(9) 



Consequently, if one observes a probability distribution 
P yielding a large value for the relative entropy D(P\\Q), 
the assumption that it was rather drawn from the proba- 
bility distribution Q is very questionable (see also below) . 

Let us now return to our original problem: For an ex- 
periment aiming at the preparation of the state g, we de- 
fine a measure for how well the observable A can exclude 
the state a as the relative entropy of the corresponding 



(10) 



where A = 5Zl=i a i^i is the spectral decomposition of 
A. From the above discussion, this is a measure for how 
strongly the measurement results of the observable A on 
the state q on average show that they are due to the 
state q rather than the state a. (Note, that in this inter- 
pretation we assume that the experimental precision in 
implementing the observable A outperforms the precision 
that can be achieved for the preparation of the state q.) 

Let us discuss the interpretation of this quantity. Sup- 
pose that the measurement has been performed, resulting 
in an observed probability distribution P = {pi, . . . ,p m } 
of the outcomes oi, ... ,a m , and let N be the number 
of measurement runs. Then, by Eq. (jH]), the probability 
that a measurement on the state ct, after N measurement 
runs, results in the same frequencies is given by 



Q N [T{P)] ~2 



-ND(P\\Q) 



(11) 



where Q = {Tr(crlli), . . . , Tr(aTI m )}. If the experimen- 
tally prepared state is close enough to the intended state 
g, the relative entropy D(P\\Q) will attain a large value 
only if this is already the case for Da(q\\<?)- 

For comparison, when tossing a fair coin N times, the 
probability of the outcome always being "tails" is 



2 -W_D({l,0}||{i,i}) = 2 -N 



(12) 



since Eq. @ is exact in this example. Thus, the proba- 
bility Eq. (|TT|) of obtaining the frequencies P after mea- 
suring ./V times the state a is equal to the probability of 
always obtaining "tails" in N — ND(P\\Q) tosses of a 
fair coin 15[. In other words, the likelihood after TV mea- 



surement runs that the prepared state is a is the same 
as that of a coin to be fair after N = ND(P\\Q) tosses 
resulting in "tails" . This gives our results for the measure 
D in Eq. (|10p a quantitative interpretation. 

When measuring several observables Ai,...,Ak inde- 
pendently of each other, the relative entropy of the joint 
probability distributions is given by the sum of the rel- 
ative entropies for the individual observables (cf. Prop- 
erty [5] in Appendix |A")) . However, we renormalize the rel- 
ative entropy in this case and define 



k 

D Au .... Ak (g\\a) ^IY^DaMW), 



(13) 



where the prefactor 1/fc corresponds to keeping the over- 
all number of measurement runs constant, independent 
of the number of observables, i. e., each observable Ai will 
be measured in N/k runs. We choose this definition be- 
cause in experiments the rate at which entangled states 
are being created is typically low, so the number of mea- 
surement runs is a scarce resource. 
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Finally, we consider the minimum of D over all local 
unitaries of a, 

V Au ...,aM\°) = ™ D Al _ Ak (g\\UaU^. (14) 

U t ij u 

In the following we will discuss how to find families of 
observables Ai which maximize this quantity. 



III. DISCRIMINATING FOUR-QUBIT STATES 

As our first example, we will calculate the quantities T 
and T> for the discrimination of the four-qubit GHZ state 
from the four-qubit linear cluster state and vice versa. 

The four-qubit GHZ state is given by 



IGHZ, 



1 



V2 



(|0000> + |1111)). 



(15) 



Alternatively, it can be described by its stabilizing oper- 
ators: The GHZ state is the unique common eigenstate 
with eigenvalue +1 of the 16 operators 

Sghz 4 = {1111, 11ZZ and perm., ZZZZ, 

XXXX, -XXYY and perm., YYYY}. (16) 

Here and in the following X, Y, Z, and 1 denote the Pauli 
matrices and the identity, tensor product signs have been 
omitted and "perm" denotes all possible permutations 
of the qubits which give different terms. The set of all 
stabilizing operators forms a commutative group, and the 
GHZ state is an example of a graph state |22|. We will 
explain this in more detail in Section fVl below. The sum 
of all stabilizing operators gives the projector onto the 
state 



|GHZ 4 )(GHZ 4 



1 

16 



(17) 



this property is shared by any graph state. The stabilizing 
operators thus contain a description of the correlations 
present in the state. 

The linear cluster state given by 



|C 4 ) = |(|0000) + 10011) + |1100) - 1 1111)) 
is also a graph state: Its stabilizer group is [22[ 

s Ci = {1111, UZZ, ZZ11, zzzz, 

XYXY, XYYX, YXXY, YXYX, 
1ZXX, Z1XX, XXtZ, XXZt, 
-1ZYY, -Z1YY, -YY1Z, -YYZ1}, 



(18) 



(19) 



and the analogous relation to Eq. (fTT]) holds. 

The stabilizing operators of a graph state \ip) provide 
a natural choice of observables for the discrimination of 
q = from other states. In the language of Ref. [i~4| . 

they are characteristic operators for the graph state. In 
the following we will restrict our analysis to these observ- 
ables. 



A. Discriminating the GHZ state from the cluster 

state 

We first consider the discrimination of the GHZ state 
from all LU equivalents of the cluster state, using all 
stabilizing operators of the former, excluding only the 
identity, as it is useless for any discrimination task. We 
introduce the notation S* — S \ {1} for any stabilizer 
group S minus the identity. Later we will discuss which 
subset of the stabilizer group gives the strongest discrim- 
ination. We start with the calculation of the quantity V 
in Eq. flT4"l) . which is based on the relative entropy. 

It is useful to think of the cluster state as the sum of its 
stabilizing operators |C4)(C4| = jq StsSc ^ ' ^ or an ^ 
GHZ stabilizing operator S € <Sghz 4 ! the term Ds is a 
function of the overlap of S with the stabilizing operators 
of the cluster state 

Z} s (GHZ 4 ||C 4 ) = -log{i[^- £ Tr(ST) + l]}. (20) 

TGSc 

If we do not consider local unitaries, Tr(ST) is zero un- 
less S — T. For the minimization over local unitaries in 
Eq. (fl4)) we classify stabilizing operators by the qubits 
on which they act nontrivially. For any GHZ stabilizing 
operator S, only those stabilizing operators of IC4) which 
act nontrivially on the exactly the same qubits as S can 
have a nonvanishing overlap with S. This still holds if ar- 
bitrary local unitary operations are applied to IC4). We 
can thus identify those stabilizing operators of | C4) that 
can contribute to Ds- 

If no local unitary is applied, the GHZ stabilizing op- 
erators UZZ, ZZtt, and ZZZZ have maximal overlap 
with cluster stabilizing operators and thus give the mini- 
mal relative entropy of zero, while \Z\Z, 1ZZ1, ZUZ, 
and Z1LZ1L each have zero overlap and thus give relative 
entropy of 1. A minimization over local unitaries cannot 
improve this result, as the cluster state has no stabilizing 
operators acting nontrivially on the same qubits. 

All of the remaining stabilizing operators of |GHZ 4 ) 

£ = {XXXX, -XXYY, -YYXX, YYYY, 

-XYXY, -XYYX, -YXXY, —YXYX} (21) 

act on all four qubits (such stabilizing operators describ- 
ing four-point correlations we call four-point stabilizing 
operators for short). We note that both these and the 
four-point stabilizing operators of IC4) except ZZZZ are 
products of local operators X and Y. It is therefore rea- 
sonable to assume that for the minimization of it suf- 
fices to consider rotations about the z axes. The rotated 
cluster state is 



Wl,S)) = \ (loooo) 



|0011) 



+ e- i7 |1100) - e- i(7+,5) |llll)), (22) 

where 7 = ip\ + tp^, 8 — (^3 + ^4, and the ipi are 
the rotation angles about the local z axes, and we ob- 
tain L> S (GHZ 4 ||C 4 (7, 6)) = -|{log[|(l+sirx(7)sm(<5))] + 
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log[4(l — cos(7) cos(5))]}. The minimum of this expres- 
sion is — log(3/4). In conclusion, we have found that 



2>s &HZ (GHZ 4 ||C 4 ) 



1 

15' 



81og : 



0.4880. (23) 



Since our analytic optimization required an assumption 
we would like to add that this result is also obtained via 
numerical minimization over all local unitaries. The 15 
GHZ stabilizing operators do not contribute equally to T>, 
rather, D = for ttZZ, ZZtt, and ZZZZ: D = 1 for 
tZtZ, 1ZZ1, ZttZ, and ZtZt; and D = — log(3/4) w 
0.4150 for all others. 

Let us now turn to the fidelity-based measure J- for the 
same observables and states. If we use all stabilizing op- 
erators of \tp), excluding again only the identity, J r ('0||</>) 
is a function of the fidelity 



2™ / 



(24) 



where n is the number of qubits. 

For our example, we note that for an arbitrary lo- 



cal unitary U we have |(GHZ 4 |£/|C 4 



< 



1/2 and this 



bound can be reached. This follows from the known fact 
that the maximal overlap of the cluster state with any 
product state, and thus with |0000) and 1 1 1 1 1) , is given 
by 1/4 [23[ and one can easily find a local unitary with 
|(GHZ 4 |£/|C 4 )| 2 = 1/2. So we have 



^ HZ4 ( GHZ 4l|C 4 ) = - 



(25) 



as the fidelity-based measure for the discrimination. 

Let us now discuss subsets of the stabilizer group as ob- 
servables for the discrimination. In our previous analysis, 
it turned out that not all stabilizing operators contribute 
equally to the discrimination, in fact, some of them do 
not contribute at all. We therefore ask for families of sta- 
bilizing operators of |GHZ 4 ) that discriminate |GHZ 4 ) 
from the local unitaries of |C 4 ) most strongly, that is, 
families for which J- (or T>) is maximal. 

From the previous discussion, candidates are 



tZtZ, tZZt, zttz, ztzt, 



(26) 



since any of them gives J- = T> = 1. But if we want 
to exclude not only all LU equivalents, but also all per- 
mutations of qubits of the cluster state, we still have to 
minimize T and T> over all permutations, because the set 
of observables is no longer necessarily permutation invari- 
ant. There are three distinct permutations of the cluster 
state, namely IC 1 ) = |C 4 ) = §(|0000) + |0011) + |1100) - 
lllll)), |C 2 ) = ±(|0000) + |0110) + |1001) - |1111», 
and |C 3 ) = |(| 0000) + joiOl) + |1010) - 1 1111)). Ta- 
ble U shows the GHZ stabilizing operators from Eq. (|2"6")l 
along with all stabilizing operators of the permutations 
of |C 4 ) that act on the same qubits. We see that any 
single one of these six stabilizing operators gives a rela- 
tive entropy of zero, if the entropy is minimized over all 



|GHZ 4 ) 


IC 1 ) 


|C 2 > 


|C 3 > 


ttzz 


ttzz 






tztz 






tztz 


tzzt 




tzzt 




zttz 




zttz 




ztzt 






ztzt 


zztt 


zztt 







TABLE I: Stabilizing operators of the GHZ state and stabi- 
lizing operators of the three permutations of the cluster state 
acting on the same qubits (see text for further details). 



permutations. Any pair of stabilizing operators gives an 
entropy of either zero or 1/2. The three-element family 
{ttZZ, tZtZ, tZZt] gives 2/3, in total there are eight 
such families giving the same value. 

It is easy to see that these families of stabilizing opera- 
tors are optimal: It is clear that they are optimal among 
all subsets of the six stabilizing operators in the table. 
Furthermore, we recall that we found a local unitary 
transformation such that all remaining stabilizing oper- 
ators contribute either or — log(3/4) to the entropy. 
Because of the permutation invariance of the set of these 
remaining stabilizing operators, this holds for all permu- 
tations of |C 4 ). As — log(3/4) < 2/3, adding some of 
the remaining observables cannot improve the discrimi- 
nation. This shows the optimality of our three-element 
families. These families are also optimal when using J- 
instead of T>. 

We summarize the main results of this subsection in 
the following observation: 

Observation. For the discrimination of the GHZ state 
from all local unitaries and permutations of qubits of 
the cluster state, using all GHZ stabilizing operators ex- 
cept the identity, the measures T> and T are given by 
Eqs. (|23[) and (|25[) . When considering subsets of the sta- 
bilizer group, {11ZZ, 1Z1Z, tZZt} is an example of 
an optimal family of observables, giving J- =T> = 2/3. 

Finally, let us add that until now we assumed that 
all observables are measured independently. However, as 
the observables in Eq. (f26|). from which we constructed 
the optimal families, have a common eigenbasis of prod- 
uct states (the computational basis), they can also be 
measured jointly in one experiment with more than two 
outcomes. In this case, the relative entropy is no longer 
given by Eq. (|13p . When measuring the computational 
basis in one experiment with 16 outcomes, we obtain 



16 

T> = min > 

U&LU t— 1 
i=l 

= 1. 



|( ei |GHZ 4 )| 2 log( 



|(e,|GHZ 4 )| : 
\(ei\U\C^ 



(27) 



Consequently, considering measurements with more out- 
comes can give a stronger discrimination. This is a con- 
sequence of a general feature of the relative entropy: For 
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each of the observables in Eq. (l26l) , the probability distri- 
bution for the measurement outcomes is obtained from 
the one for the measurement of the computational basis 
by considering several events as one (in other words, by 
"forgetting" information). The relative entropy satisfies 
a grouping rule similar to the Shannon entropy (Prop- 
erty 2] in Appendix \X§ , which implies that this process 
can only decrease the relative entropy. 



B. Discriminating the cluster state from the GHZ 

state 

Let us now consider the reverse discrimination 
f s* (C4IIGHZ4). This turns out to be relatively simple. 

First, note that the eight three-point stabilizing opera- 
tors of IC4) will for any local unitary operation have zero 
overlap with | GHZ4) as the GHZ state has no three-point 
stabilizing operators. For the remaining eight stabilizing 
operators, however, the overlap with the GHZ state can 
brought to 1 by an appropriate rotation, as one can di- 
rectly check. Thus 

2? S£4 (C 4 ||GHZ 4 ) = ^. (28) 

As a function of the fidelity, T is the same as for the 
reverse discrimination 

J- 5£4 (C 4 ||GHZ 4 ) = ^. (29) 

Considering the optimal subsets of the stabilizer group, 
it is clear that any set of three-point stabilizing operators 
of IC4) is an optimal family of observables, resulting in 
T> = T = 1. Note that the GHZ state is permutation 
invariant, so the optimization over permutations does not 
play a role. 



C. Application to a four-photon experiment 

To study the noise tolerance of the quantities T and 
V and their performance for experimental data, we use 
the measurement results for the stabilizer correlations of 
the cluster state obtained by Kiesel et al. in a photonic 
experiment fl3| . When using all cluster stabilizing opera- 
tors for the discrimination (excluding the identity) , these 
data give 

^s £i (&x P ||GHZ 4 ) = 0.257± 0.014, (30) 
£ , s 54 G?ex P ||GHZ 4 ) =0.189 ±0.012. (31) 

When using only the three-point stabilizing operators, 
which form an optimal family Q, we get 

J c Q( (? exp||GHZ4) = 0.668 ±0.019, (32) 
£> Q (f?cxp||GHZ 4 ) = 0.353 ± 0.021. (33) 



Note that in all cases the observables are normalized with 
respect to the perfect cluster state as (C4|Ai|C4) = 1, 
while for the experimental data we have Tr(g exp Ai) < 1. 
Also, it should be noted that the subsets of observables 
we use were chosen to be optimal for the perfect cluster 
state but not necessarily for the experimental one. This, 
however, is similar to the implementation of entangle- 
ment witnesses in experiments: There, one typically con- 
siders some optimal witness for some pure state that one 
aims to prepare and applies it to the experimental data 
in order to obtain a significant entanglement test fill ]. 

To investigate the power of our discrimination meth- 
ods, we calculate T and V for both the perfect and the 
experimental cluster state under the influence of white 
noise. Figures [T] and [2] show T and T> as functions of the 
noise level. We make a number of observations: 

For the perfect cluster state with additional white 
noise, the quantity J- decreases with increasing noise 
level until it reaches zero at the noise level (1 — p) = 
J r (C4||GHZ4). This interpretation of J 7 as a noise toler- 
ance was already mentioned in Section III Al Note, how- 
ever, that the in the case of the experimental state the 
noise tolerance is no longer given by J-"(g C xp||GHZ 4 ) but 
is larger due to Ti(g exp Ai) < 1. 

For the same observables, the maximal noise level at 
which T> > is at least as high as the maximal noise 
level at which T > 0. This is a general feature: As a 
consequence of the positive definiteness of the relative 
entropy, D is nonzero whenever F is. In this particular 
example, T> is nonzero for noise levels arbitrarily close to 
1, though this is not a general feature. 

For the three-point stabilizing operators of IC4), also 
the noise tolerance of J 7 is 1 (Fig. |2J). It is instructive to 
compare this to the case of witness operators: The set 
of separable states contains a ball around the completely 
mixed state (24|, which implies that for any witness W 
and entangled state g detected by W the noise tolerance 
is strictly less than 1. In our case the situation is different: 
The reason for the noise tolerance of one is that no local 
unitaries of the GHZ state have any three-point correla- 
tions. This implies that the set of states LU equivalent 
to IGHZ4) does not contain a ball around 1/d. For a fair 
comparison, one may therefore consider the three-point 
correlations in experiments aiming at the generation of 
GHZ states (for instance, in Ref. [25[ they were maxi- 
mally 0.097) and ask whether the measured three-point 
stabilizer correlations in a cluster state experiment sig- 
nificantly exceed these values. 

Comparing the figures shows another difference be- 
tween the measures J- and T>: For the measure J 7 , the 
noise tolerance is higher in the case of the three-point 
stabilizing operators (Fig. [5]) than in the case of all stabi- 
lizing operators (Fig. [T]). This is remarkable because the 
former set of observables is contained in the latter. In 
other words, adding an observable can reduce the noise 
tolerance of T . From the definition of the quantity T> 
it is clear that its noise tolerance of T>a x , ...,A k is lower 
bounded by the noise tolerance of V for any subset of 
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FIG. 1: Discriminating the four-qubit linear cluster state with 
noise from all local unitaries of the GHZ state, using all sta- 
bilizing operators of the former. Shown are T (red circles) 
and T> (blue squares) versus the level of white noise (1 — p) 
for the perfect (empty symbols) and the experimental (filled 
symbols) state. 



Ai , . . . , A}. . In Ref . [26[ , a quantity similar to J- was con- 
structed from the correlations of an entangled state and 
used for entanglement detection. The same phenomenon 
of a decreasing noise tolerance when including more cor- 
relations was observed. 

The preceding observations concerning the comparison 
of the measures T and T> can be understood by noting 
that the relative entropy uses all information contained 
in the probability distributions for the measurement out- 
comes, whereas the quantity T effectively reduces each 
probability distribution to one parameter. 

We recall that the value of the relative entropy T> has 
an interpretation in terms of probabilities (Section III B|) . 
This is in contrast to the quantity J 7 , whose numerical 
value is fixed only by a normalization condition on the 
observables (Section III A[) . While in certain cases it can 
be interpreted as a noise tolerance and it is useful for 
comparing different observables, it gives no quantitative 
statement about the discrepancy between the prepared 
state and states which we want to exclude. Finally, we 
note that in experimental applications also the error es- 
timates for either quantity must be taken into account. 
Though we have done so in Eqs. (1501) -([55 ]) . a systematic 
analysis of this point is beyond the scope of our present 
article. 



IV. DISCRIMINATING THREE-QUBIT STATES 

Now we consider the three-qubit case, aiming at the 
discrimination of the three-qubit GHZ state and the 
three-qubit W state. These two states are relevant as rep- 
resentatives of the two different entanglement classes of 
genuine three-qubit entanglement Q. 
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FIG. 2: Discriminating the four-qubit linear cluster state with 
noise from all local unitaries of the GHZ state, using all three- 
point stabilizing operators of the former. Symbols as in Fig.[T] 

The three-qubit GHZ state is given by 

|GHZ 3 ) = -L(|000) + |lll». (34) 

As in the four-qubit case, this state can be described by 
its stabilizing operators. The stabilizer group is given by 
the eight observables 

Sghz 3 = {111, tZZ and perm., 

XXX, -XYY and perm.}. (35) 

The three-qubit W state 

|W 3 ) = -^(|001) + |010) + |100)) (36) 

is not a stabilizer state. If we expand its density matrix 
into Pauli matrices we arrive at 

|W a )(W 3 | = — [3 • 111 + + perm.) 

+2(1A"A" + perm.) + 2(1YY + perm.) - (tZZ + perm.) 

+2(XXZ + perm.) + 2(YYZ + perm.) - 3 ■ ZZZ] . 

(37) 

A. Discriminating the GHZ state from the W state 

Again, we will first compute JF and V for the case 
that all stabilizing operators (except for 1) of the GHZ 
state are used, and afterwards look for optimal families 
of observables. 

Parameterizing local unitaries as U((p, 9, ip) = 
exp(f0cr z /2) exp(i9a v /2) exp(i(^cr 2 /2), we obtain 
(W 3 |C/ f 1ZZU\W 3 ) = ihcos(6» 2 )cos(6> 3 ) + 2cos{ip 2 - 
ipz) sm(9 2 ) sin(6> 3 )]. The expectation values of Z1Z and 
ZZ1 can be obtained by cyclically permuting the indices 
of the angles, as the W state is permutationally invariant. 
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We find max LU (W 3 |lZZ|W 3 ) = 2/3, where the maxi- 
mum is attained when cos((p2 — f 3 ) sin^) sin(# 3 ) = 1. 
Thus the expectation values of \ZZ, Z\Z, and ZZ\ can 
be maximized simultaneously. We choose the solution 
<Pi = <f2 = <P3 = and 9\ = 8 2 = 3 = tt/2. 

Let us now consider the remaining stabilizing opera- 
tors. Assuming the above choice for the angles tfi and Oi 
and making the symmetry assumption ipi = -02 = 03 = 
0, we have (W 3 \W XXX U\W 3 ) = 3cos 3 (0) - 2 008(0) 
and (W 3 \W(-XYY)U\W 3 ) = 3cos 3 (0) - | cos(0) and 
finally 



Dxxx,-xyy. (GHZ 3 ||[/|W 3 )) 

-YXY-YYX 

= - log [- (l + 3z 3 - 2z) (l + 3z 3 



7 ,3 



where z = cos(0). This function is minimal at z = 
In conclusion, we found that 



1 



13 



(38) 
-1/2. 

43, 



(GHZ 3 ||W 3 ) = -(-3 log --log — -31og — 
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0.2235 



G 



1G 



(39) 



1 and (W 3 \U^{-XYY)U\W 3 ) = 2/3. This proves our 
claim. 

One might ask if these optimal families of observables 
can be used for the construction of a witness operator 
for the GHZ entanglement class. This is, however, not 
the case, as 

max (W 3 \{1ZZ + ZtZ + ZZt)\W 3 ) = 3 (43) 

SLOCC of W3 

holds, which can be seen from the fact that the fully 
separable state |000) gives this value. 

We summarize the main results of this subsection: 

Observation. // all GHZ stabilizing operators except the 
identity are used for the discrimination of the GHZ state 
from all L U equivalents of the W state, the measures T> 
and F are given by Eqs. (|39[) and (|40[) . // we consider 
subsets of the stabilizer group, any combination of 1ZZ , 
Z1Z, andZZl is an optimal family of observables, yield- 
ing F — 1/3 and T> = — log(5 /6) . 



B. Discriminating the W state from the GHZ state 



While our analytical calculation required some symme- 
try assumptions, this value is also obtained by numerical 
minimization over all Euler angles. 

For the fidelity-based measure F we note that 
the rotation we found when minimizing T> gives 
|(GHZ 3 |C/|W 3 )| 2 = 3/4. This is known to be the high- 
est possible overlap of the GHZ state with local unitarics 
(or, indeed, SLOCC) of the W state '271- With Eq. (JUJ) 
we obtain 



7s. HZ3 (GHZ 3 ||W 3 ) = -. 



(40) 



In Rcf. 27] the state 



|W 3 ) 



2y/6 



(3,-1,-1,-1,-1,-1,-1,3) (41) 



was found to maximize the overlap of |GHZ 3 ) with 
the SLOCC class of the W state. This state is in 
fact LU equivalent to |W 3 ). Though |W 3 ) minimizes 
F, it does not minimize D, as it gives the value 
D S'a» Z3 (GHZ 3 ||W 3 ) = -f log § « 0.2255. 

We want to find families of observables that give the 
highest values of F and V. We claim that any combina- 
tion of 



For the reverse problem, the discrimination of |W 3 ) 
from the local unitaries of |GHZ 3 ), there is no obvious 
choice of observables, as the W state is not a stabilizer 
state. However, each of the observables 



HZ. IZt. Ztt 



(44) 



has an expectation value of 1/3 for the W state and zero 
expectation value for all local unitaries of IGHZ3), as 
for the GHZ state all reduced one-qubit density matrices 
are maximally mixed. Thus any appropriately normalized 
combination of the above observables gives F = 1, which 
is the optimal value. 

All of these combinations give 



2?(W 3 ||GHZ 3 ) = ^log^-ilog^ 



0.0817. 



(45) 



This is the best possible value among all operators oc- 
curring in the decomposition of |W 3 ) in Eq. ([37|l . To see 
this, we choose the rotation ip = 2ir/3, 6 = 3n/2, and 
ijj = 57r/4 on all qubits and observe that all these opera- 
tors give a value of D less or equal to the one in Eq. P5l) . 



GENERAL GRAPH STATES 



tzz, ztz, zzt 



(42) 



is optimal in this sense among all GHZ stabilizing oper- 
ators. As we have seen above, any element of this family 
gives F — 1/3 and V = — log(5/6). We only have to show 
that for any other stabilizing operator, or combination 
of stabilizing operators, there exists a local unitary such 
that F < 1/3 and D < — log(5/6). Taking in the above 
calculations cos(^) = 1 we have (W 3 \W XXX U\W 3 ) = 



In the previous sections we observed that the stabi- 
lizing operators of a given state are natural candidates 
for observables that discriminate this state from other 
states. The graph states are a large class of states which 
are unambiguously described by their stabilizing oper- 
ators [22]. In this section we will derive some general 
statements about the discrimination of graph states. 

A graph state is described by its graph, that is, a col- 
lection of vertices, each standing for a qubit, and edges 
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(a) (b) (c) 




(d) (e) 



FIG. 3: Graphs of the graph states appearing in the text, 
(a) Four-qubit GHZ state, (b) four-qubit linear cluster state, 
(c) three-qubit GHZ state, (d) and (e) the graph states dis- 
cussed at the end of Section[V] In (d) and (e) dashed red lines 
denote the presence of two-point correlations. 

connecting vertices. Some graphs are depicted in Fig. [3J 
With each vertex (or qubit) i we associate the correlation 
operator Ki defined as the observable that acts as X on 
the vertex i and as Z on every vertex in the neighborhood 
N(i) of i, i.e., all vertices connected to i by an edge 

Ki = Xi Yl Zj. (46) 

The graph state |G) is defined as the unique common 
eigenstate of all Ki with eigenvalue +1. The products of 
the Ki form a commutative group S, and the graph state 
is an eigenstate of all elements of this group. The group 
S is called the stabilizer group of |G) and its elements 
the stabilizing operators. For an n-qubit state, that sta- 
bilizer group has 2 n elements. The sum of all stabilizing 
operators Si £ S gives the projector onto the graph state 

l G >< G l = ^l>' (47) 

i=l 

and Eq. (II T[) is an example of this general property. The 
stabilizing operators thus contain a description of all cor- 
relations present in the state. 

Alternatively from describing perfect correlations, the 
graph can be understood as a description of the inter- 
actions that created the state [22]: Every edge corre- 
sponds to an Ising-type interaction and the state is ob- 
tained by applying these interactions to the product state 
|+)|+) • • • |+). In addition, it should be noted that it is 
possible for two different graphs to lead to states which 
are equivalent under local unitary transformations and 
hence have the same entanglement properties [22j . 

As an example, we consider the graph in Fig. [3] (a). 
The generating operators of the stabilizer group are 

K 1 = XZZZ. K 2 = ZXlt, K 3 = ZtXt, K± = ZttX. 

(48) 



After a local change of bases we recognize the graph state 
as the four-qubit GHZ state with the complete stabilizer 
group given in Eq. (fTB]). The four-qubit linear cluster 
state and the three-qubit GHZ state are given by the 
graphs in Figs. [3J (b) and [3] (c). 

In our discussion on the discrimination of the GHZ 
from the cluster state we learned that the optimal fam- 
ilies of observables consist of two-point stabilizing oper- 
ators. The reason for their optimality is that the cluster 
state has fewer two-point stabilizing operators than the 
GHZ state. Hence one may try do derive general results 
depending only on the numbers of two-point (or higher- 
order) stabilizing operators. 

The number of two-point correlations of a graph state 
can easily be obtained from its graph 

cam 

. Restricting 

ourselves to connected graphs with three or more vertices, 
there are three possibilities to obtain two-point stabiliz- 
ing operators: 

1 . Vertices connected to the rest of the graph by only 
one edge. The generator associated to such a vertex 
is a two-point operator of the form XZ. 

2. Pairs of unconnected vertices whose neighborhoods 
are equal: N(i) = N(j). The product of the two 
generators associated to such a pair has the form 
XX. 

3. Pairs of connected vertices for which N(i) U {i} = 
N(j) U {j}. This means that their neighborhoods 
apart from i and j are the same. The product of 
their generators has the form YY. 

The product of the generators associated to vertices i 
and j is never equal to the identity at positions i and j. 
Therefore it is a two-point stabilizing operator if and only 
if it is equal to the identity at all other positions, which 
leaves only the possibilities [2] and [3] For the same reason 
the product of three or more generators is never a two- 
point stabilizing operator. This shows that the above list 
exhausts all possibilities to obtain a two-point stabilizing 
operator. 

Now, let |Gi) and \G 2 ) be two graph states, k\ and k 2 
the numbers of two-point correlations of these states, and 
let Pq x be the set of two-point stabilizing operators of 
|Gi). We assume k\ > k 2 (our result will be trivial other- 
wise) . We can then derive a lower bound on J r p Gl (Gi \\ G 2 ) 
and 'Dp Gi (G1HG2) that depends only on the numbers k\ 
and k 2 . Namely, we have 

J 7 p Gl (Gi\\G 2 ) > ~^~~j~ L i (49) 

£>p Gl (Gi||G 2 ) > (50) 

To see this, note that from the above discussion it is 
clear that all two-point stabilizing operators of the same 
graph state act on different pairs of qubits 28] . It follows 
that any two-point stabilizing operator of \G 2 ) can have 
nonzero overlap with at most one stabilizing operator of 
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\Gi). This shows that at least k\—k 2 two-point stabilizing 
operators of \G\) will have zero overlap with \G 2 ), and 
thus give J- = T) = 1. Note that this still holds if local 
unitaries and permutations of qubits are considered. 

For |G? X > = |GHZ 4 ) and \G 2 ) = |C 4 ) the bounds give 
the exact results. This is, however, not always the case, 
as the example of Figs. [3] (d) and [3] (e) shows: Here, 
k± = 4 and k 2 — 3. Of the two-point correlations of 
Id) , three connect three qubits in a triangle, while for 
IG2) the connected pairs are all disjoint. This shows that 
at most two of the two-point stabilizing operators of 
|Gi) can have nonzero overlap with \G 2 ). In this ex- 
ample, T P ^ (G X ||G 2 ) = V P ^{G X \\G 2 ) = 1/2 > 1/4 = 
(ki - k 2 )/ki. 

While one can also use higher-order stabilizing oper- 
ators for the discrimination, it is more difficult to de- 
rive general results for them. Nevertheless, for two given 
graph states the number of three-point (or higher-order) 
correlations can directly be computed by writing down 
the whole stabilizer group; one may then compare the 
different numbers of higher order stabilizing operators. 



VI. DISCUSSION AND CONCLUSION 

In this article, we have developed strategies for showing 
that an experimentally prepared state is not in a certain 
class of undesired states, given by all local unitaries, or 
all local unitaries and permutations of qubits, of another 
state. 

We introduced two measures for the discrimination 
strength of observables. The first measure, denoted by 
J 7 , was based on the difference of the expectation values 
of the observable on either state. It could be interpreted 
as a noise tolerance, similar to the case of witness op- 
erators. The other measure, denoted by T>, was defined 
as the relative entropy of the probability distributions 
for the measurement outcomes on either state. It gives 
a quantitative answer to the discrimination problem by 
allowing to compare the likelihood that the experimental 
data origin from the undesired state with, e.g., the like- 
lihood that a sequence of highly biased coin tosses can 
be explained by a fair coin. In particular, the measure T> 
is defined in such a way that a maximization of it corre- 
sponds to maximizing the discrimination strength while 
keeping the total number of measurement runs constant, 
thus answering the question for the best discrimination 
with limited experimental resources. 

We discussed in detail the discrimination of the four- 
qubit GHZ state from the four-qubit linear cluster state, 
and vice versa, using stabilizer observables. We also dis- 
cussed the three-qubit GHZ state and the W state, thus 
showing that our method can be generalized to states 
that are no graph states. Finally, we derived a general 
result on the discrimination of two graph states with the 
two-point stabilizing operators of one of them, using the 
power of the graph formalism. 

For a specific example we studied the noise tolerance 



of the quantities J 7 and T> and their performance for ex- 
perimental data. Concerning the comparison of the two 
measures, our observation that the measure T> is more 
robust against noise than T could be explained by the 
fact that the relative entropy uses all information con- 
tained in the probability distributions, while the mea- 
sure JF effectively reduces each probability distribution 
to one parameter (namely, the sum of the expectation 
values). Thus, our conclusion is as follows: Either mea- 
sure can be used to compare the suitability of different 
observables for a given discrimination task. For the eval- 
uation of experimental results the relative entropy T> is 
to be preferred, since it uses all available information and 
allows for a clear statistical interpretation. 

There are several interesting open questions and possi- 
ble generalizations for the future: First, one could extend 
our analysis to the measurement of nondichotomic ob- 
servables or arbitrary product bases [see our discussion 
of Eq. (|27pj. Second, one could consider SLOCC equiva- 
lence classes instead of LU classes. Our measures T and 
T> are equally applicable for that case, only the optimiza- 
tion is different. Third, one could connect our results to 
other methods for characterizing multipartite entangle- 
ment classes. For instance, there exist witnesses distin- 
guishing the class of mixed three-qubit GHZ states from 
the class of mixed W states [13] ■ Here, mixed W states are 
those three-qubit states that can be written as a mixture 
of pure states that are biseparable or SLOCC equiva- 
lent to the W state. The discrimination of such classes of 
mixed states is a different problem than the one consid- 
ered in this article; nevertheless, it would be interesting 
to understand possible connections. 
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Appendix A: Properties of the relative entropy 

The relative entropy, also called Kullback-Leibler 
divergence, from the probability distribution P = 
{pi, . . . ,p m } to the probability distribution Q = 
{qi, . . . , q m } is defined as (l8j 

TCI 

D(P\\Q) = Y,P* 1 °Z(-)- ( A1 ) 

i=i q% 

We use the logarithm to the base of two, log = log 2 , 
and define 01og(0) = (more explicitly: Olog(^) = 0, 

pl°g(o) = 00 ' an d Ql°g(§) = 0)- We list without proof 
the properties of the relative entropy that are needed in 
this article: 
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1. Positive defmiteness: D(P\\Q) > with equality if 
and only if P = Q. 

2. D(P\\Q) = oo if and only if = 0, but p l > for 
some i, that is, if an event is impossible according 
to the probability distribution Q, but occurs with 
a nonvanishing probability according to P. 

3. D(P\\Q) ^ D(Q\\P) in general, so D is not a met- 
ric. 

4. Grouping rule: If wc decide to consider events 1 
and 2 as one event, leaving us with probability 
distributions P = {p\ + p^,P3, ■ ■ ■ ,Pm} and Q = 
{Qi + q2,q3,---,Qm}, then 

D(P\\Q) = D(P\\Q) + (p! +p 2 )D(P'\\Q'), (A2) 



where P' = {pi/(pi +Pi)iPij{p\ + Pi)} and Q' = 
{<Zi/(</i + «fe),«2/(«?i + Q2)}- Note that D{P'\\Q') 
is the relative entropy within the "box" containing 
events 1 and 2, and (pi + P2) is the P probability 
of obtaining an event from this box. 



5. If P and Q are the joint probability distributions for 
two independent random variables, Pij — p^pf^ 
and qij = qf\f \ then 

D(P\\Q) = D(P^\\Q^) + D(PW\\Q<V). (A3) 
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